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Multivariate extreme-value analysis is concerned with the extremes in a multivariate 
random sample, that is, points of which at least some components have exceptionally 
large values. Mathematical theory suggests the use of max-stable models for univariate 
and multivariate extremes. A comprehensive account is given of the various ways 
in which max-stable models are described. Furthermore, a construction device is 
proposed for generating parametric families of max-stable distributions. Although 
the device is not new, its role as a model generator seems not yet to have been fully 
exploited. 
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1. INTRODUCTION 



Multivariate extreme-value analysis is concerned with the extremes in a 
multivariate random sample, that is, points of which at least some components 
have exceptionally large values. Isolating a single component brings us back to 
univariate extreme-value theory. In this paper, the focus will rather be on the 
dependence between extremes in different components. The issue of temporal 
dependence will be ignored, so that the dependence will be understood as cross- 
sectional only. 

Mathematical theory suggests the use of max-stable models for univariate 
and multivariate extremes. The univariate margins must be one of the clas- 
sical extreme- value distributions, Frechet, Gumbel, and extreme- value Weibull, 
unified in the generalized extreme- value distributions. For the dependence struc- 
ture, however, matters are mo re complicated. A coniplete c haracterization in the 
multivariate case was given in de Haan and Resnick (1973), describing extremal 



dependence in terms of spectral measures on a subset of the unit sphere. Statis- 
tically, this formulation is not always the most practical one, and a large number 
of other concepts have been proposed. 

The aim of this paper is to provide a comprehensive account of the various 
ways in which max-stable models are described (Section [2|). Second, a construc- 
tion device is proposed for generating parametric families of max-stable distribu- 
tions (Section [3]). The device is not new as it appears already for instance in the 
theory of regularly varying multiv ariate time s eries in iBasrak and Seger 



or in the concept of a D-norm in Falk et al. ( 20101 ). Still, its role as a model 



generator seems not yet to have been fully appreciated. 

Inference on multivariate extremes via max-stable models for joint tails is 
an exciting field that is still in development. Inference methods can be non- 
parametric or parametric, and in the latter case, they can be likelihood-based, 
frequentist as well as Bayesian, or based on other techniques such as the method 
of moments or minimum distance estimation. 

Max-stable models have the drawback that they are too coarse to describe 
tails of multivariate distributions with asymptotic independence sufficiently accu- 
rately. More refined models exist and they are the topic of an ex tensive literature, 
originating from the seminal paper of iLedford and Tawnl (jl996l ^. 



2. FUNDAMENTALS 



Max-stable distributions arise out of the study of the limit behaviour of 
vectors of componentwise maxima. Their dependence structure can be described 
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via a number of dependence functions or via certain measures. All these objects 
are related and they highlight different aspects of the distribution of multivariate 
extremes. 



2.1. Convergence of vectors of componentwise maxima 



Let Xi = {Xii, . . . ,Xid), for i = 1, . . . , n, be independent and identically 
distributed random vectors in dimension d. For each component j = 1, . . . ,d, 
consider the sample maximum, and collect these maxima in a new random vector: 

M„, = (M„i , . . . , M„d) , Mnj = max{Xij ,...,Xnj). 

Observe that most of the time, the point Mn does not belong to the sample, as the 
maxima in different components will typically occur at different time instances. 
Still, the analysis of the large-sample distribution of Mn is a natural starting 
point for multivariate extreme-value theory. 

Weak convergence of a sequence of random vectors implies weak conver- 
gence of each of the components. As in univariate extreme-value theory, it is 
therefore reasonable to apply increasing affine transformations to each of the 
margins and consider the sequence of random vectors 

(2.1) (Ml^l^:j = i^...^d), n = l,2,... 



a. 



in terms of normalizing constants a„j > and For each component j = 

l,...,d, the weak limit of (M„j — bnj)/anj as n — )• oo must be a univariate 
max-stable distribution, and necessary and sufficient conditions on the marginal 
distribution Fj of the jth component Xij for s uch convergence to take plac e are 
well known, see for instance the monograph by de Haan and Ferreira ( 20061 ). 



However, weak convergence of each of the d components in (12. ip is strictly 
weaker than joint convergence of the vector of normalized maxima. What is 
needed in addition is a condition on the dependence structure of the common 
joint distribution F of the random vectors X^. A convenient way to describe this 
dependence is via the copula Ci of F, that is, 

(2.2) Pr[Xi < x]=F{x) = Ci{Fi{x^),...,Fd{xd)). 

Assuming the margins Fi, . . . ,Ffi are continuous, as we will do henceforth, the 
copula Ci of the distribution function F in ()2.2p is unique and can be obtained 
as the joint distribution function of the random vector {Fi{Xii), . . . ,Fd{Xid))- 

Weak convergence of a sequence of multivariate distribution functions to a 
limit with continuous margins is equivalent t o weak corivergence of the sequence s 
of margins and of the sequence of copulas ( Deheuvels . 1984 : Galamboi . 19871 ). 



The copula of the vector of component- wise maxima iVf„ , and hence of any vector 
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that is obtained by applying increasing transformations to each of its components, 
is given by 

(2.3) C„(«) = {Ci(nl^...,ny")r. 

This can be checked from the fact that the joint distribution function of A<f„ is 
_F" while its margins are F" for j = 1, . . . ,d. Hence, in order for the normalized 
maxima in (j2.ip to converge in distribution to a nondegenerate limit, besides 
marginal convergence, the sequence of copulas Cn must converge as well. 

The copulas that can arise as weak limits of C„ as n — t- cxd are called 
extreme-value copulas, that is, a copula C is called an extreme-value copula if 
there exists a copula Ci such that, as n — )• oo, 

(2.4) hm {Ci (n}/", . . . , u'J^) = C{ui, . . . , n,). 

Extreme-value copulas arise as the class of possible limit copulas of vectors 
Mn as n — )• OO. The copula Ci is said to be in the domain of attraction of 
C. An extensive surv e y of the literature on extreme-value copulas is given in 
Gudendorf and Segers ( 20101 ). 



The class of extreme-value copulas coincides with that of max-stable cop- 
ulas, defined as follows. A copula C is max-stable if, for all u G [0, 1]"^ and 
fc = 1,2,..., 

In the setting of componentwise maxima of independent random samples, the 
previous identity means that the copula Ck of the random vector Mk is the same 
for every sample size k. Clearly, a max-stable copula is also an extreme-value 
copula, being in its own domain of attraction. Conversely, each extreme-value 
copula can be shown to be max-stable: in (|2.4|) . partition the sample of size 
n = mk in m blocks of size k and let m tend to infinity for fixed k. Since the 
limit must not depend on k, the max-stability relation follows. 

In summary, we have found that nondegenerate limit distributions of vec- 
tors of appropriately normalized componentwise maxima have extreme- value mar- 
gins and an extreme-value or max-stable copula. Specifically, if 



Pr 



G{xi, . . . ,Xd), n^oo, 



then necessarily 



G(xi, ...,Xd) = C(Gi(xi),. . . ,Grf(xrf)) 



with extreme- value margins Gi, . . . , and an extreme- value copula G. Conver- 
gence of margins and convergence of copulas being two isolated issues, we can 
ignore the former and rather focus on the latter. In fact, the way in which the 
components are normalized is immaterial, as long as the transformations applied 
to the components are increasing. 
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2.2. Dependence functions 



Take logarithms and apply a linear expansion to see that (|2.4p is equivalent 



to 



(2.5) lim n{l - Ci(l 



n ^xi, . . . , 1 - n ^Xd)} 
= -logC(e-^\...,e-^'*) 



X G [0, oo) 



The li mit I is called the stable tail d ependence function of C, going back to lHuang 
(I1992I ) and brees and Huand (|l998l ). The variable n tending to infinity along the 
positive integers can be replaced by a variable t tending to infinity along the 
positive reals. 

The best known example is the Gumbel-Hougaar d copula, for which ^^f a:) = 
+ • • • + x^)^/^ in terms of a parameter G [1, 00] ( Gumbell . Il96d : iHougaardl . 

The function happens to be the 0-norm of the ve ctor x. The fact t hat 
is a norm is no coincidence: in a remarkable paper by iMolchanov (l2008l l. a 
characterization is g iven of all the nor ms that can give rise to stable tail depen- 
dence functions. In Falk et al. is called the D-norm of x, with D 
referring to the Pickands dependence function, see below. 

Let X = (Xi, . . . , X(i) denote a generic random vector in the original sam- 
ple. The expression on the left-hand side in ()2.5p contains the rescaled probability 



(2.6) 1-Ci(l 



xi/n, . . . , 1 - Xd/n) 
= Pr[Fi(Xi) > 



or 



or Fd{Xd)>l-Xd/n]. 



This probability concerns the event that at least one among the d components 
Xi, . . . ,Xd should exceed a high percentile of its own distribution. The copula 
domain-of-attraction condition (|2.4p . originally involving the vector of compo- 
nentwise sample maxima, has been replaced by a condition on the upper tail of a 
single random vector. This is akin to the familiar peaks-over-threshold approach 
in univariate extreme-value theory. 



The tail copula, i?, of Schmidt and Stadtmiiller ( 20061 ) arises if all d com- 
ponents are required to exceed a large percentile simultaneously: 

lim nPr[Fi(Xi) > 1 — xi/n and . . . and Fd{Xd) > 1 — x^/n] 

n— >cxD 

= R{x), a;G[0,oo)'^. 

Clearly, the relation between the functions I and R is governed by the inclusion- 
exclusion formula. In higher dimensions, (. is somewhat more convenient than 
-R, as setting some components Xj in the definition of i to zero allows one to 
retrieve the lower-dimensional margins of the extreme-value copula. This is not 
possible for the tail copula R, as setting even a single Xj to zero immediately 
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Figure 1' 

Left: the stable tail dependence function £ - at least one compo- 
nent should be large. Right: the tail copula R ~ all components 
must be large simultaneously. Here Uj — Fj{Xj) for j = 1,2. 



yields R{x) = 0. The difference between the two functions i and R is depicted 
in Figure [TJ 

The dependence functions i and R are homogeneous: 

i{ax) = lim t {l - Ci(l - xi/{t/a), . . . ,1 - Xd/it/a))} 
= lim a s \l — Ci{l — xi/s, . . . ,1 — Xd/s)} 
(2.7) =ae{x), a > 0, ic G [0, oo)'', 

and similarly for R. It is therefore sufficient to consider the restriction of these 
functions to the unit simplex Ad-i = {{wi, . . . , Wd) ^ [0, 1]'^ : wi + ■ ■ ■ + Wd = 1}. 
The restr i ction of i to Ad-i is called the Pickands dependence function, D, after 



Pickandsl (Il98ll ). By homogeneity, 

i{x) = {xi-\ ^Xd)D{wi,...,Wd), Wj = ■ — ^— . 

xi-\ \-Xd 

Frequently, the variable Wd = 1 — wi — ---— Wd-i is suppressed from the notation 
and D is written as a function of the arguments {wi, . . . , Wd-i) only. 

The probability on the right-hand side of (|2.6p involves the union of the 
events {Fj{Xj) > 1 — xj/n}, each of which has probability Xj/n, provided < 
Xj < n. As a consequence, we have the elementary bounds 

max(xi/n, . . . , Xd/n) 

<Pr[Fi(Xi) > l-xi/n or ... or Fd{Xd) > 1 - Xd/n] 

< xi/n + • • • + Xd/n. 
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Multiplying by n and letting n tend to infinity, we obtain 

(2.8) m.ax{xi, . . . ,Xd) < iixi, . . . ,Xd) < Xl -\ + Xd, xe[0,oo)'^. 

By (j2.5p and (j2.8p . an extreme- value copula C must satisfy 

(2.9) ui---Ud<C{ui,...,Ud) <max{ui,...,Un), it G [0, l]*^. 

The lower and upper bounds in the two previous displays can be attained, 
corresponding to the extreme cases of independence and perfect association. 
In particular, ma x -stab le models are positive quadrant dependent. In fact, in 
Garralda-Guillem (|2000l '). the stronger property is shown that bivariate extreme- 



value copulas are monotone regression dependent; see also Theorem 5.2.10 in 
Resnickl (| 19871 ). 



By ()2.5p . the copula C can be given in terms of the tail dependence function 
through 

C(ni, ...,Ud)= exp{-^(-logui, . . . ,-logUrf)}, u G (0, l]"*. 

In extreme-value theory, it is often convenient to standardize to other distribu- 
tions than the uniform (0, 1) law. The three most common forms are the unit 
Frechet distribution, the Gumbel distribution, and the reverse exponential dis- 
tribution, yielding respectively 

(2.10) C(e-i/"S...,e-^/"'') = exp{-£(l/xi,...,l/xrf)}, a;e(0,oo)^ 

(2.11) C(e-'^"'S...,e-<^""') = exp{-£(e-^i,...,e-^'*)}, x € 

(2.12) C(e"\...,e"'') =exp{-£(-xi,...,-Xrf)}, a; e (-oo, 0)"^. 

When using unit Frechet margins, the notation V{xi, . . . , Xd) = i{l/xi, . . . , l/xd) 
is often employed too. 



2.3. The intensity measure 



The transformation of the components Xj to uniform (0, 1) random vari- 
ables via the probability integral transform Fj{Xj) has the disadvantage that 
all the action regarding the upper extremes is compressed to the neighbourhood 
of 1. Instead, for a univariate sequence ^1,^2, •• • of independent and identically 
distributed random variables with common distribution function F^, define the 
first exceedance time of the level x by 

T(x) = mf{i = 1,2,... : x}. 

If F^{x) < 1, then T{x) will be a geometric random variable with success proba- 
bility equal to 1 — F^{x). Its expectation. 
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Gumbel copula, theta = 2: uniform margins 



Gumbel copula, theta = 2: Pareto margins 
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Figure 2: 



Two views on a sample: uniform (left) versus Pareto (right) 



is called the return time of the level x. 



Now let us apply this return time transformation to each of the d compo- 
nents of the random vector (Xi, . . . ,Xci)- The return time of observation Xj is 
Yj = 1/{1 — Fj{Xj)}. The law of Yj is unit Pareto rather than uniform on (0, 1), 
as Fr[Yj > y] = Fr[Fj{Xj) > 1 - 1/y] = l/y for y > 1. We find that values of Xj 
corresponding to high percentiles of Fj are mapped to large values of Yj. As is 
evident from Figure [21 extremes are magnified. 

Now suppose that the copula Ci is in the domain of attraction of an 
extreme- value copula with stable tail dependence function i. Equation ()2.5p says 
that the random vector Y = (Yi, . . . , Y^) satisfies 



t{l - Ci{l - xi/t, . . . , 1 - Xd/t)} = t Pr [j{Yj > t/xj} 

Lj=i -I 

= t Pt[y /t e {[o,^f \[o,i/x])] 

—7- i{x), t — )■ OO. 

It follows that on the space = [0, oo]'^ \ {0}, there exists a measure fi such that 



(2.13) 



E 



n £ 



1=1 



n Pr[l^/n G • ] A /i( • 



n 



oo. 



The limit takes place in the mode of vague convergence of measures, meaning that 
linin^oonElfiY /n)] = J^^ f{x) dfj.{x) for every bounded, continuous function / 
on Erf that vanishes in a neighbourhood of the origin. Intuitively, when n grows 
large, the vector Y /n is pulled towards the origin, in the neighbourhood of which 
the function / is zero. The intensity measure then only concerns the upper tail of 
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the distribution of Y. The first expression in the previous display shows that ^ 
expresses the hmiting average intensity in space of the normahzed sample cloud 
{Yi/n, . . . ,Yn/n}. The stable tail dependence function acts as a distribution 
function for the intensity measure /x, as we have 

^(a;) = ^([0,oo]'^\[0,l/^]). 



2.4. Extreme profiles 



Assume that the vector of return times Y = (Yi, . . . , 1^) with Yj = 1/{1 — 
Fj{Xj)} is large, that is, at least one of its components exceeds a high threshold. 
The relative sizes of the d components then inform us about the extremal depen- 
dence: are some components large simultaneously or is one specific component 
dominating all the other ones? Specifically, for y G [0, c«)'^ \ {0} put 

r{y) = yi-\ hyd G (0,oo), 

wiy) = {yj/r{y) j = l,. . . ,d) g Aa-i, 

to be thought of as the magnitude and the profile of the vector y, respectively. 
The spectral measure H lives on the unit simplex and is defined by 

H{B) = fi{{y : r{y) > I, w{y) € B}), 

for Borel subsets B of Aa^i. 

The measure convergence in (|2.13p implies that 



(2.14) E 



Y,Hr{Yi)>n, w(Yi) G •} 



i=l 



n Fv[r{Y) > n, w{Y) £ ■] ^ H{-), n oo. 



with vague convergence being replaced by weak convergence because the state 
space Arf_i is compact. In particular, 

(2.15) Fv[w{Y) G • I r{Y) > t] A = Q( • ), t ^ oo, 

meaning that the conditional distribution of the profile w{Y) given that the 
magnitude r{Y) is large converges in distribution to the normalized spectral 
measure Q. 

The spectral measure H and the profile distribution Q{-) are alternative, 
equivalent ways of describing the extreme- value copula C. Indeed, homogeneity 
of i in (j2.7p implies homogeneity of fi: 

(2.16) ^(a-) = a"V(-)> a > 0. 
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As a consequence, the intensity measure satisfies 

(2.17) n{{y : r{y) > z, w{y) G B}) = z'^ fi{{y \r>l,w£B}) 

= z-^H{B) 

for z > and for Borel sets B of Arf_i. That is, when expressing a point y in 
the coordinates (r, w), the intensity measure /x factorizes into a product measure 
on (0, oo) X Arf_i given by r~'^dr Hidw). Equation (|2.17p leads to 

f{y)fi{dy)= / / f{rw)r-^drH{dw) 
JAa-i Jo 

for /i-integrable functions /, showing how to recover /x and thus £ and C from 
H. The special case where / is equal to the indicator function of the set {y : 
maxj Xj-yj > 1} for some x G [0, oo)'' yields, after some computation, 

(2.19) i{x)= / max {wjXj)H{dw). 

Incidentally, this representation of i implies that i must be convex. By special- 
izing the bounds in (j2.8|) to the unit vectors in M,'^, one finds that the spectral 
measure H must satisfy the constraints 

(2.20) 1 = e{ej) = / Wj H{dw), j = l,...,d. 
It follows that the total mass of H is equal to 

H{Ad^i) = j\wi + --- + Wd) H{dw) = d. 

Thanks to this property, it is possible to recover the spectral measure H from 
the profile distribution Q. From ()2.20p . it then follows that a random vector 
W = iWi, . . . , Wd) on A(i_i with law equal to Q must satisfy 

(2.21) EQ[ty,]= / w,Q{dw) = l/d, j = l,...,d. 

In Section [3l we will see that any such law Q can appear as the profile distribution 
of a (i-variate max-stable distribution. 



In case of asymptotic independence, £{x) = xi + • • • + Xd, the profile dis- 
tribution Q is equal to the discrete uniform distribution on the d vertices of 
Ad-i: asymptotically, only one component can be large at a time. In the case of 
asymptotic perfect dependence, £{x) = max(3;i, . . . ,Xrf), the profile distribution 
Q is degenerate at the center (l/d, . . . , l/d) of Ad-i'. all components are equally 
large. These two extreme cases are depicted in Figure El 

To show the ease with which coefficients related to extremal dependence 
can be computed, consider the random variable 

d 

N{t) = Y,I{FjiX,)>l-l/t}, 
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Figure 3: 



Asymptotic independence (left) versus asymptotic perfect de- 
pendence (right). 



counting the number of components that exceed a high percentile. The following 
dependence coefficients have natural interpretations. 



Trouble is in the air: 

lim t PT[N{t) > 1] = ...,!)= / max(u'i, ...,Wd) H{dw). 



Multiple failures: 



lim t Fr[N{t) > k] 



Wr 



-k+1) H{dw), 



for k = l,...,d, where < ■■■ < w^^^-^ denote the order statistics of 
{wi, . . .,Wd). 

The sky is falling: 

limtFr[N{t) = d]=R{l,...,l)= / mm{wi, . . . ,Wd) H{dw). 



System collapse - how bad will it get? 

I{w{i) H ^W(^d-k))H{dw) 



lim E[N{t) - k \ N{t) > k] 



Jw(^d-k+i)H{dw) 



for fc = 1, . . . , d — 1. 
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CONSTRUCTING MODELS 



There is a fairly large nu mber of parametric max-stab le m odels available; 
see fo r instance the overviews in Kotz and Nadarajah ( 200d ) and Beirlant et al 
( 2OO4I ) ■ In a search for flexi ble models in large di n iensio n s, new families are 



still b e ing construct e d, as in iBallani and Schlathed (1201 Ih . iBoldi and Davison 
(l2007l ). ICoolev et all (|20ld ') and iFoug eres et alJ iwm . .n this section, a simple 
construction device will be proposed and illustrated. 

From Section [21 we recall that max-stable models for extremal dependence 
can be represented either via the extreme-value copula C, the stable tail depen- 
dence function £, the intensity measure /i, or the spectral measure H and its 
normalized version, the profile distribution Q. However, as these objects must 
satisfy certain constraints, construction of parametric models is not obvious, par- 
ticularly in high dimensions. Even if flexible parametric forms can be found, 
interpretation of the model parameters may not be obvious. In addition, when 
working with the spectral measure or profile distribution, the passage to lower- 
dimensional margins can be awkward, as the conditioning events in ()2.15p will be 
different according to which components are selected. 



3.1. A construction device 



Let Z he a unit Frechet random variable, that is, Pr(Z < z) = exp(— l/z) 
for z > 0. Let A = {Ai, . . . , A^) be a random vector, independent of Z, such that 
< E[max(j4j, 0)] < cxd for every j E {1, . . . ,d}. Consider the random vector 

(3.1) X = {Xi,...,Xa) = {AiZ,...,AaZ). 

The max-stable attractor of the distribution of X can be explicitly calculated. 

Lemma 3.1. For x = (xi, . . . , x^) G (0, 00)'^, we have 

(3.2) lim Pr[Xi <nxi,...,Xd< nxdT 

n— >oo 

= exp{- E[max(^i/xi, . . . , Ad/xd, 0)]}. 



Proof: Let x G (0, 00)'^. We have 

Pr[Xi < XI, . . . < Xrf] = Pr[^iZ < xi, . . . < Xrf] 

= Pr[^i/xi < 1/Z, . . . , Ad/xd < l/Z] 
= Pr[l/Z > max(^i/xi, . . . , Ad/xd)] 

The distribution of l/Z is unit exponential. Since A and Z are independent, 

Pr[Xi < xi,. . . ,Xd<Xd \ Al,. . . ,Ad]= exp{-max(^i/xi, . . .,^^7x^,0)}. 
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It follows that 

Pr[Xi < xi,. . . ,Xd < Xd] = E[exp{-max(^i/xi, . . . ,Ad/xd,0)}]. 

Let ^(i),yl(2),... be a sequence of independent and identically distributed 
copies of A. Fix positive integer n. We have 

Pr[Xi <nxi,...,Xd< nxdr 
= (E[exp{-max(^i/a;i, . . . , vlrf/x^, 0)}])" 



E 



f 1 " V 
exp<^ ^max(4'Va^i,---,4'V3;d,0) \ 



Equation (j3.2p now follows by the law of large numbers and the dominated con- 
vergence theorem. □ 



The margins of the limit distribution function, say G, in (I3.2p are equal 
to Gj{xj) = exp{— E[max(ylj, 0)]/xj} for Xj > 0. Assume that E[max(^j,0)] = 
1 for all J = 1 , . . . , d; this can always be achieved by rescaling the variables 
Aj if necessary. In that case, the margins of G are unit Frechet. Comparing 
equations (13. 2j) and (12.10p . we find that the stable tail dependence function of G 
is given by 

(3.3) iA{xi, ...,Xd) = E[max(xi^i, . . . , XdAd, 0)] 



The spectral measure H corresponding to i in (]3.3p can be identified too. 

Write Aj = max(Aj,0) and put R = Af ^ h ^j. On the event R > 0, 

define Wj = Aj / R; on the event R = 0, the definition of Wj is immaterial - for 
definiteness, put Wj = l/d if = 0. Note that W takes values in A^-i and that 
E[R] = Yfj=i E[^+] = d. We have Aj = RWj and thus 

^A{xl, ...,Xd) = E[R max(VFia;i, . . . ,WdXd)] 

= E[E[R \Wi,...,Wd] max(TyiXi, . . . , WdXd)] ■ 

Comparing this expression with (12.19p . we find that the spectral measure H of 
£a is given by 

(3.4) H{dw) = E[R\W = w]Pt[W £ dw], 

that is, H is absolutely continuous with respect to the law of W with Radon- 
Nikodym derivative equal to E[R \ W = w]. Similarly, the profile distribution Q 
satisfies 

(3.5) Qidw) = E[R\W = w] Pv[W G dw], 

Intuitively, this makes sense: profiles W that on average yield larger values of R 
will have a larger contribution to the joint extremes of X . 
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Incidentally, this construction shows that any probability distribution Q 
on Aii_i satisfying ()2.2ip can appear as the profile distribution of a d-variate 
max-stable distribution. Indeed, let the random vector W on A^_i have law Q 
and put Aj = dWj for all j = 1, . . . , d. As Ai + + Ad = dhy construction, the 
law of the random vector X in p.ip is in the domain of attraction of a d-variate 
max-stable distribution with profile distribution equal to Q. 

If the dimension d is large, realistic models of extremal dependence should 
allow for the possibility that only some but not all components of a random 
vector are large simultaneously. In terms of the spectral measure or the profile 
distribution, this is encoded by the lower-dimensional faces of the unit simplex. 
For a non-empty subset / of {1, . . . , d}, let Ad-ij denote the set of all w in A^-i 
such that Wj > if j G / and Wj = otherwise. If the probability of the event 
{miiiji^j Aj > > maxj^jc Aj} is non-zero, then by ()3.4p . the spectral measure 
and the profile distribution will put positive mass on A^-ij. The set / contains 
the indices of the components that are large. 



3.2. Examples 



The simplicity of (13. Sh is appealing and the reader is invited to the apply 
the recipe in order to produce his or her own parametric models. In the remainder 
of the article, a number of well-known examples are worked out. 

Example 3.1 (Independence). Suppose that Pr[^j > > max{Ai : i ^ 
j)] = pj for pj > and pi + - • • +Pd = 1- Then only component can be large at the 
time. After standardization ensuring that E[max(Aj, 0)] = 1 for all j = 1, . . . ,d, 
we find Ia^xi, . . . ,Xd) = E[max(xiAi, . . . ,XdAd,0)] = xi + ■ ■ ■ + Xd, the stable 
tail dependence function of the independence copula. 

Example 3.2 (Perfect dependence). Suppose that j4j = a^i? with prob- 
ability one for all j = 1, . . . ,d, for some constants aj > and a random variable 
B such that E[max(i?,0)] is positive and finite. Then the profile of an extreme 
vector is fixed. After standardization, aj = 1/E[max(i?,0)], the stable tail de- 
pendence function is that of perfect positive association, that is, £a{xi, ■ ■ ■ , Xd) = 
max(3;i, ...,Xd). 

Example 3.3 (Discrete spectral measures). Suppose that the distribu- 
tion of A is discrete with a finite number of atoms. Specifically, suppose that 
Pr[A = Ofc] = pk for k G {1, . . . ,rn}, where G H'^ and pk G (0, 1) such that 
SfcLiPfc = 1- Via standardization, ensure that 1 = E[max(74j, 0)] = X^feLi Pfe'^fcj 
for ah j = 1, . . . , d. Put = a^i H + and write at = VkWk: if > 0, 
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then Wkj = a'^j/fk, whereas if = 0, then put Wkj = l/d. It follows that 

m 

(Aixi,..., Xd) = ^Pk max(a^^xi, . . . , a^^x^) 
k=i 

m 

= y^(pfcrfc)max(wfcixi, . . . ,WkdXd)- 



k=l 



We find that the spectral measure H and the profile distribution Q are discrete 
and are given by 

m m 

fc=i fc=i 

with 5w a unit point mass at w. The probabilities pt are tilted with the mag- 
nitudes rfc, giving higher prominence to profiles that are associated to larger 
values of r^. 

Max-stable models with discrete s pectral measures are called extreme- value 
factor models in Einmahl et al. I drnil). Each of the m possible outcomes 
results in a different profile, according to the type of event or factor that triggered 
the extreme value. 



Example 3.4 (Random indicators). Let 

y) = E[max(x^, yB, 0)], (x, y) G [0, oo)2, 

in terms of random variables A and B such that E[max(yl, 0)] = E[max(B, 0)] = 1. 
Let (/, J) be a pair of random indicators, independent of the pair {A,B), such 
that Pr[I = 1] = p, Pr[J = 1] = g, and Pr[/ = J = 1] = r. In the definition of 
replace the pair {A,B) by the pair {p~^IA,q^^JB); we assume that < p < 1 
and < (7 < 1. The new stable tail dependence function is equal to 

E[max(p~^x/A, q'^yJB, 0)] 
= E[max{p-^xA, q'^yB)] Pr[/ = J = 1] 

+ E[max(p^^x^, 0)] Pr[/ = 1, J = 0] E[max(g"^yB, 0)] Pr[/ = 0, J = 1] 
= ^(j)~^x, q^'^y) r + p~^x {p — r) + q^^y {q — r) 

Writing a = r/p = Pr[J = 1 | / = 1] and /3 = r/q = Pr[/ = 1 | J = 1], we find 

(3.6) 4,/3(x,y) =^(ax,/3y) + (l-a)x + (1-/3)2/, (x,y) e [0,oo)2. 

The new tail copula is simply 

Ra,/s{x, y) = x + y - la,fs{x, y) 

= ax + f3y — £{ax, j3y) = R{ax, /3y), (x, y) G [0, oo)^. 



Max-stable models 



17 



This is an asymmetric, two-parameter extension of the original model. Imposing 
the equality constraint a = /S = 9 G [0, 1] yields the symmetric, one-parameter 
extension 

(3.7) £e{x,y) = e£{x,y) + {l-d){x + y). 



In higher dimensions, a vector of indicators (Ii, . . . , Id) can serve to switch 
some components Xj 'on' or 'off'. The dependence structure in these indicators 
then yields an extremal dependence structure for the random vector X. Specif- 
ically, let pj = Pr[/j = 1]; we assume < pj < 1. Let A = {Ai, . . . , A^) be a 
random vector independent of (Ii, . . . , Id) and such that E[max(74j, 0)] = 1 for 
all j = 1, . . . ,d. Then we can define a stable tail dependence function via 

(3.8) ipixi, ...,Xd)= E[max(p^^j;i/i^i, . . . ,p~^XdIdAd, 0)] 

= X] P^'^') E[max{p~'^XjAj : j G c)] 

0^cC{l,...,ci} 

where p{c) = Pr[{j = 1, . . . ,d : Ij = 1} = c]. Keeping the law of A fixed, the 
parameter vector p is equal to a probability distribution (p(c))c on the non-empty 
subsets c of {1, ... , d}. 

In this way, hierarchical structures can easily be built. For instance, in 
dimension d = 4, we can think of {Ii, I2, 13, 14) = (J^ K, K). One can also 
think of logit-type models for the indicators. 



Example 3.5 (Marshall-Olkin copula). Applying the device in (|3.6p to 
the function i{x, y) = max(x, y) yields the model 

ia,i3{x, y) = max(aj;, (3y) + (1 — a)x -|- (1 — (3)y = x + y — m.\n{ax^ (3y). 

The extreme-value copula associated to la,i3 is the Marshall-Olkin copula 

Ca,i3iu,v) = uv mm.{u~'^ ,v~^) = mm.{u^~°'v, uv^~^), {u,v) £ [0,1]^. 

In higher dimensions, applying the device (|3.8p to the function i(x) = max(a;), 
that is, Aj = 1 with probability one, we get the model 

ip{xi, ...,Xd)= ^ p{c) max(p~^Xj : j G c), 
the spectral measure of which is discrete. Anot her stochastic interpretation of 



this model is provided in lEmbrechts et al.l (|2003l ). 



Example 3.6 (Dirichlet model). Let ai, . . . ,ad be positive parameters 
and let Zi, . . . , Zd be independent Gamma(aj, 1) random variables, that is, the 
density of Zj is 

fj{z) = -^z^^-'e-\ 0<z<oc. 
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Put Aj = ^Zj, a positive random variable with unit expectation. The stable 
tail dependence function associated with the random vector A = {Ai, . . . , A^) is 

iA{xi, . . . ,Xd) = E[max(xi^i, . . . ,XdAd)] 

= E[max(Qj'^xiZi, . . . , a^^x^Zd)] 

= E[(Zi H \- Zd) inayi{a^^xiVi, . . .,a^^XdVd)], 

where Vj = Zj/(Zi + • • • + Zd). It is well known that the random vector 
{Vi, . . . , Vd) is independent oi Zi + ■ ■ ■ + Zd and has a Dirichlet distribution with 
parameters (ai, . . . , ad)- We find 

(3.9) iA{x) = E[(ai H h a^) max(aj;^2;il/i, . . . , a'^^XdVd)] 

r(E ■=! + 1) f f -1 ,TJ -.-1 . 

= nl. Via.) .^T".>^- "^"^-^ n . . . d.._. 



We recognize the Dirichlet model introduced in Coles and Tawnl (1991). 



Example 3.7 (Polynomial Pickands functions). In the Dirichlet model 
3.9p . put d = 2 and ai = Q2 = 1, to obtain 



i{x,y) 



2 / m.aLX.{xv,y{l — v)} dv = X + y 
Jo 



xy 
x + y' 



The Pickands dependence function associated to £ is D{t) = £{l—t, t) = 1— t(l— t) 
for t G [0, 1]. Applying the transformation in (13. 7p yields the mixture model 

D{t) = l-et{l-t), t£[0,l], 

in terms of a parameter 9 G [0, 1], introduced in Tawnl ( 19881 ). This is the only 
model in which D is a quadratic polynomial. Applying the transformation in 
p.6p yields the rational model 

Da,^(.t) = e{a{l - t),pt) + (1 - a)(l - t) + (1 - /3)t 
a/3t(l - t) 



a(l -t) +/3t' 
in terms of parameters a, /? G (0, 1]. 



t G [0,1], 



More generally, bivariate max-stable models of which the Pickands depen- 
dence function is a higher-order polynomial can be obtained via the transforma- 
tion in ()3.7p applied to the function i(x, y) = E[max(x^, yB)] when A and B are 
random sums 

A = Ei + ■■■ + £,;, B = Fi + --- + Fk, 

in terms of independent random variables J, K, Ei, Fi, E2, F2, . . . such that J 
and K are counting random variables having finite support and unit expectation 
and where Ei, Fi, E2, F2, . . . are unit expone ntial random variables . Poly nomial 
Pickands dependence functions are studied in lOuillotte and PerronI toii ). 
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Example 3.8 (Schlather model). Let {S,T) be a bivariate normal ran- 
dom vector with standard normal margins and with correlation p £ (—1,1). Put 
A = V27rS and B = V27rT, so that E[max(^, 0)] = E[max(5, 0)] = 1. The stable 
tail dependence function corresponding to the random pair (A, B) is 

tp{x,y) = E[max(xAy5,0)] 

/ V2^2/t-Le-(^'+2p^*+*')/2dsdt 
_=-ooJt=o 27r 

f'oc f'O -I 

+ / / ^/2^xs— e-(^'+2^^*+*')/2dsdi 
Js=o Jt=-oo 27r 

roo roc -| 

+ / / \/2^max(xs,yt) — e-(^'+2'^^*+*')/2dsdt. 
Jo Jo 27r 

After some tedious calculations, this can be shown to be 



ipix, y) = hx + y)(l + Jl-2{p + l) 



X + y)"^ J ' 



Schlatheil ^200^ ). The Pickands dependence function corresponding to this 



see 
model is 

1 



Dp{t) = -{1 + - 2(/9 + l)t(l - t)}, tG[0,l]. 



Example 3.9 (Hiisler-Reiss model). Let (5", T) be a bivariate normal 
random vector with standard normal margins and with correlation p € (—1,1). 
Let fj > and put 

^ = exp(o-5-o-V2), B = exp(o-r-crV2). 

The pair (A, B) is lognormal with unit expectations, and it yields the stable tail 
dependence function 

/OQ fOQ -I 
/ max{xe"^ ye"*}e-"'/2 J_e~is'+2pst+t^)/2 
-OD J —OO ^'^ 

The double integral can be calculated in terms of the standard normal cumulative 
distribution function yielding 

L{x,y) = x$( ^ + ilog(x/y) ) +y^{'^ + -\og{y/x) 
\ 2 a J \ 2 a 



with p arameter a = ay^2{l — p). This is the model introduced in lHiisler and Reiss 

diQsa ). 
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